Picture for Zhangquan Chen

Zhangquan Chen

Do LLMs Build World Models From Text? A Multilingual Diagnostic of Spatial Reasoning

Add code
May 27, 2026
Viaarxiv icon

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Add code
May 25, 2026
Viaarxiv icon

SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents

Add code
May 18, 2026
Viaarxiv icon

Meow-Omni 1: A Multimodal Large Language Model for Feline Ethology

Add code
May 09, 2026
Viaarxiv icon

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Add code
May 07, 2026
Viaarxiv icon

SpaMEM: Benchmarking Dynamic Spatial Reasoning via Perception-Memory Integration in Embodied Environments

Add code
Apr 24, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon

Joint Geometric and Trajectory Consistency Learning for One-Step Real-World Super-Resolution

Add code
Feb 27, 2026
Viaarxiv icon

Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation

Add code
Feb 05, 2026
Viaarxiv icon

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Add code
Feb 05, 2026
Viaarxiv icon